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Abstract 

Given a finite poset P, we consider the largest size La(n, P) of a family of 
subsets of [n] := {l,...,n} that contains no (weak) subposet P. This problem 
has been studied intensively in recent years, and it is conjectured that tt(P) := 
limn-Kjo La(n, P)/[\ »i) exists for general posets P, and, moreover, it is an integer. 

For k > 2 let T>k denote the ^-diamond poset {^4 < B\, . . . < C}. We study 
the average number of times a random full chain meets a P-free family, called the 
Lubell function, and use it for P = T>k to determine vr(Pfc) for infinitely many values 
k. A stubborn open problem is to show that tt(T>2) = 2; here we make progress by 
proving ir{T>2) — ^ exists). 

1 Introduction 

We are interested in how large a family of subsets of the n-set [n] := {1, . . . , n) there is 
that avoids a given (weak) subposet P. The foundational result of this sort, Sperner's 
Theorem from 1928 [16], solves this problem for families that contain no two-element 
chain (that is, for antichains), determining that the maximum size is (^nij- For other 
excluded subposets, it is interesting to compare the maximum size of a P-free family to 

We give background and our new results for this study in the next section. One small 
forbidden poset that continues to stymie all interested researchers is the diamond poset on 
four elements. We present a better new upper bound on the size of diamond-free families. 
For /c-diamond-free families for general k, we provide bounds that, surprisingly, turn out 
to be best-possible for infinitely many values of k. 
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In Section 3 we introduce our method for this subject, the Lubell function of a family, 
which gives the average number of times a random full chain meets the family. The Lubell 
function yields an upper bound on the size of a family. For diamond-free families, we 
observe that the maximum possible Lubell function decreases with n, and a calculation 
gives our bound. For excluding /c-diamonds, our new idea is to partition the set of 
full chains, obtaining bounds on each block of the partition. The Lubell and full chain 
partition methods hold promise for other families of forbidden subposets. 

Section 4 contains our detailed proofs, except for the long proof of Theorem 12.4} which 
is given in Section 5. The paper concludes with our ideas for advancing the project. 

2 Background and Main Results 

For posets P = (P, <) and P' = (P', <'), we say P' is a weak subposet of P if there exists 
an injection / : P' — > P that preserves the partial ordering, meaning that whenever u <' v 
in P', we have f(u) < f(v) in P [TF]. Throughout the paper, when we say subposet, we 
mean weak subposet. We say the height h(P) of poset P is the maximum size of any 
chain in P. 

Let the Boolean lattice B n denote the poset (2^, C). We consider collections T C 2™. 
Then T can be viewed as a subposet of B n . If J 7 contains no subposet P, we say J 7 is 
P-free. We are interested in determining the largest size of a P-free family of subsets of 
[n], denoted La(n, P). 

In this notation, Sperner's Theorem [16] gives that La(n, P 2 ) = (, ni), where Vk denotes 
the path poset on k points, usually called a chain of size k. Moreover, Sperner determined 
that the largest antichains in B n are the middle level, Q^ 2 ) (for even n) and either of the 

two middle levels, (/ n I"l/ 2 ) or (( n +i)/2) (^ or n), where for a set S, (^) denotes the 
collection of z-subsets of S. More generally, Erdos solved the case of P^-free families. 
Let us denote by E(n, k) the sum of the k middle binomial coefficients in n, and let 
B(n, k) denote the collection of subsets of [n] of the k middle sizes, that is, the sizes 
[(n-k + 1)/2J ,...,[(n + k- 1)/2J or else the sizes \(n — k+ l)/2] ,...,\(n + k — l)/2] . 
So there are either one or two possible families B(n,k), depending on the parities of n 
and k, and regardless, \B(n, k) \ = E(n, k). Then we have 

Theorem 2.1 f^j/ For n > k — 1 > 1, La(n, Vk) = S(n, k — 1). Moreover, the Vk-free 
families of maximum size in B n are given by B(n, k). 

It follows that for fixed k, La(n,Vk) ~ (k — l)([2j) as n — > oo. Katona and his 
collaborators promoted the problem of investigating La(n, P) for other posets P, especially 
its asymptotic behavior for large n. Consider the r-fork poset V r , which has elements 
A < P>i, ... , B r , r > 2. In 1981 he and Tarjan [13J obtained bounds on La(n, V2) that he 
and DeBonis [1] extended in 2007 to general V r , r > 2, proving that 
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While the lower bound is strictly greater than (i«j)> we see that La(n, V r ) ~ (iSi)- 
Earlier, Thanh [18] had investigated the more general class of broom-like posets. Griggs 
and Lu [H] studied the even more general class of baton posets. These posets mentioned 
so far have Hasse diagrams that are trees. 

In [5] it is shown that for the butterfly poset B, with elements A, B both less than 
C, D, one can give an exact answer, La(n, B) = S(n, 2), for n > 3, which is asymptotic to 
2(pj). More generally, for any s,t > 2, the complete bipartite poset /C s t with elements 

A±, . . . , A s all less than Pi, . . . , B t , satisfies La(n, TC s ,t) ~ 2(^j) [I]. The A/"-poset, with 
elements A, B, C, D such that A < B, C < B, C < D, is intermediate between V2 and B. 
It is shown in [8] that La(n,J\f) ~ (iSj)- 

Based on the examples for which La(n, P) was known, Griggs and Lu [TT] proposed 
the conjecture that was certainly apparent to Katona et al: 

Conjecture 2.2 For every finite poset P , the limit vr(P) := lim^oo L f^"'^ exists and is 
integer. 

All of the examples above agree with the conjecture, and Griggs and Lu verified it 
for additional examples, including tree posets of height 2. For the crown 2 k, k > 2, 
which is the poset of height 2 that is a cycle of length 2k as an undirected graph, they 
extended the butterfly result above and proved that La(n, 2 k) ~ for all even k > 4. 
For odd k > 3, it remains a daunting problem to determine the asymptotic behavior of 
La(n, 02fc)- At least, Griggs and Lu can show La(n, 02fc)/ is asymptotically at most 

1 + 4=, which is less than 2. 

When Griggs lectured on this work on forbidden subposets in 2008, Mike Saks and 
Peter Winkler observed a pattern in all of the examples where n{P) was determined, 
which we describe as follows. For poset P define e(P) to be the maximum m such that 
for all n, the union of the m middle levels B(n,m) does not contain P as a subposet. 
Their observation was vr(P) = e(P). For instance, the middle two levels B(n,2) contain 
no butterfly B, since no two sets of the same size k contain the same two subsets of size 
k — 1. One gets that e(B) = 2, which is tt{B). In general, it is clear that when it exists, 
7r(P) must be at least e(P). 

Impressive progress in the development of the theory is the result of Bukh [2] that for 
any tree poset T (meaning that the Hasse diagram is a tree), ir(T) = e(T), so that the 
conjecture (and observation) are satisfied. It is easily verified that e(T) = h(T) — 1. 

Is there a connection for general P between 7r(P) and the height h(P)l The result of 
DeBonis and Katona for complete bipartite posets K, Stt implies that for any poset P of 
height 2, 7r(P) < 2, when it exists. However, there is no such bound for taller posets, as 
observed by Jiang and Lu (see [H]). Let the fc-diamond poset Vk, k > 2, consist of k + 2 
elements A < B\, . . . , Bj, < C. Then h(T>k) = 3 for general k, while for k = 2 r — 1, the 
middle r + 1 levels B(n, r + 1) cannot contain D^, since an interval in B(n, r + 1) with an 
element in the lowest level and an element in the highest level has at most 2 r elements (a 
subposet B r ), and so at most 2 r — 2 elements in the middle. Hence it is D fc -free. 
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The diamond T> 2 is the most challenging poset on at most four elements in this theory. 
(It is also the Boolean lattice B 2 .) It is easily seen that e(V 2 ) = 2. On the other hand, 
it is a subposet of the path P 4 . So, if 7r(X> 2 ) exists (which has still not been shown), it 
would have to be in [2, 3]; Its conjectured value is 2. 

As an illustration of the Lubell function method introduced in the next section, a short 
application is given that reduces the upper bound on 7r(D 2 ) from 3 to 2.5. A refinement 
of the Lubell function method, which involves partitioning the set of full chains in an 
appropriate way, gives our first improvement on the 2.5 bound: 

Proposition 2.3 For all sufficiently large n, La(n, ^2)/( L nj) < 2.296. 

We display this bound, not our best one, since its proof is simpler than our best bound, 
and since its proof gives us further insight into the Lubell function for T> 2 . Some time 
after we had announced our bound above, Axenovich, Manske, and Martin |T] came up 
with a new approach which improves the upper bound to 2.283. Now using our methods 
with a much more careful analysis of diamond-free families for n < 12, we can provide a 
further slight improvement, which is the best-known upper bound: 

Theorem 2.4 For all sufficiently large n, La(rz, 

^2)/( L Sj) < 2 n + < 2 - 273 - Con- 
sequently, if it exists, tt(V 2 ) G [2,2^-]. 

Because this new bound requires considerably more care, its proof is given in its own 
section following the proofs of our other results. We shall see diamond-free families in the 
proof for which the Lubell function method cannot improve the upper bound on tt(T> 2 ) 
below 2.25. Therefore, new ideas are required to bring the upper bound down to the 
conjectured value of 2. Likewise, it appears that the methods of [TJ cannot move below 
2.25. See the final section of the paper for more discussion of how we can do better. 

Given the great effort that has gone into improving the upper bound on tt(V 2 ), it is 
then quite surprising that we can solve the ir problem for many of the general diamonds 
T>k with k > 2. This can be regarded as our main result. 

Theorem 2.5 Let k > 2, and define m := |~log 2 (/c + 2)] . 

(1) If he [2 m ~ l - l,2 m - ( L ™) - 1], then 

La(n,Pfc) = S(n, m). 

Hence, vr(Pfc) = e(T>k) — m - Moreover, if J 7 attains the bound La(n, T>k), then T = 
B(n, m). 

(2) If k e [2 m - ( L ^j),2 m - 2], then, 



Hence, ifn(V k ) exists, then 



(2 m — k — 1 \ 
m + 1 7^T\ — <m + l. 
HfJ>> / 

For 7r(£> 2 ) this new theorem gives an upper bound of 2.5, not as good as the theorem 
before. However, this new result determines Tr(T) k ) for "most" values of k, in that for the 
2 m ~ 1 values of k in the range [2 m_1 -l, 2 m -2], case (1) applies to all but ( ™) ~ C2" 1 /™ 1 / 2 

of them. Moreover, we are able to give La(n, P fc ) exactly, not just asymptotically for large 
n, for such values of k. 

The poset V k can be viewed as the "suspension" of an independent set of size k, where 
we mean that a maximum and a minimum element are added to it. We can consider a 
more general suspension of disjoint paths (chains). For k > 1 let l\ > ■ ■ ■ > l k > 3, and 
define the harp poset T-L(h, . . . , l k ) to consist of paths V^, . . . , V\ k with their top elements 
identified and their bottom elements identified. For instance, in this notation we have T> k 
is the harp H(3, . . . , 3) where there are k 3's. 

Theorem 2.6 If h > ■ ■ ■ > h > 3, then 

La(n, 7£(Zi, . . . , Z fc )) = E(n, Zi — 1). 

Hence, for such harps, ir — e — l\ — 1 . Moreover, for such harps, if T is a harp-free 
family of subsets of [n] of maximum size, then J 7 is B(n, l± — 1). 

The theorem above only determines vr('H) for harps % that have strictly decreasing 
path lengths. However, for the general case in which path lengths can be equal there 
is no bound independent of k, since we have seen that for T> k , which is a harp, n(V k ) 
is arbitrarily large as k grows. It is then remarkable that we can completely solve the 
problem of maximizing La(n, %) for harps with distinct path lengths. Another novel 
aspect of this result is that for k > 2 the harps it concerns are not ranked posets. 



3 The Lubell Function 

For now let us fix some family T C 2^1 Let C := C n denote the collection of all n\ full 
(maximal) chains C {i\} C {11,12} C ■ • • C [n] in the Boolean lattice B n . A method 
used by Katona et al. involves counting the number of full chains that meet J 7 . Here we 
collect information about the average number of times chains CeC meet J 7 , which can 
be used to give an upper bound on Recall that the height of J 7 , viewed as a poset, is 

hiJ 7 ) := maxlTTlCl. 

cec 

We consider what we call the Lubell function of J 7 , which is 

h(T) = h n m := aveLFnCl. 

cec 
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This is the expected value EQJ 7 fl C|) over a random full chain C in B n . Then ^(J 7 ) is 
essentially the function of J 7 at the heart of Lubell's elegant proof of Sperner's Theorem 
([15]. cf. [7]) with the observation. 

Lemma 3.1 Let T be a collection of subsets of [n). Then ^(J 7 ) = J^FeJ 7 V (|f|) ■ 

Proof: We have that ^(J 7 ) = E(|J r fl C|), where C is picked at random from C. This 
expected value is, in turn, the sum over F G T of the probability that a random C 
contains F. Since C meets the (?) subsets of cardinality k with equal probability, it 
means that each set F contributes 1/ to the sum. □ 

Lubell's proof uses the simple facts that \A D C| < 1 for any antichain A and that 
Q) is maximized by taking = |_fj, to derive Sperner's Theorem that |^4| < (inj)- By 
similar reasoning for general families T we obtain a general upper bound. 

Lemma 3.2 Let T be a collection of subsets of[n}. Ifh{JF) < m, for real number m > 0, 
then \J-~l < 77i(,^j). Moreover, if m is an integer, then |J-"| < E(n, m), and equality holds 

if and only if JF = B(n, m) (when n + m is odd), or if T = B{n, m — 1) together with any 
((n-m)/2) su bsets of sizes (n — m)/2 or (n + m) /2 (when n + m is even). 

Proof: We use the symmetry and strict unimodality of the sequence of binomial coeffi- 
cients (I), < k < n. If h{F) < m, then |F| = Za^ 1 < E A er ( L ?j)/([A|) ^ m i[\\)- 
Now assume m > is an integer. We construct a family J 7 of maximum size, subject to 
h(F) < m, by selecting subsets A that contribute the least to ^(J 7 ), which means that we 
minimize 1/ (mi)- Essentially, we are solving the linear program of maximizing Yli x i(T) 
subject to Y2i x i — m i — x i — 1 f° r a ^ ^ e maximize by selecting J 7 to be the m 
middle levels, B(n,m). 

Further, if \F\ = E(n, m), it must be that J 7 is B(n, m) when n + m is odd. If n + m 
is even, the subsets of sizes (n — m)/2 and (n + m)/2 will tie for the m-th largest size, 
and we can freely choose any L^J subsets of the two sizes so that | = E(n, m). □ 

We see that upper bounds on the average intersection size | J-T[C\ lead to upper bounds 
on the ratio of particular interest in this paper, 1^*1/ ■ Hence, we get upper bounds 

on 7r(P), when it exists, from upper bounds on ft. (J 7 ) for P-free families J 7 . 

To illustrate how this can be useful, we now give a short proof that, if it exists, 
^^P%) — 2.5. Consider a diamond-free family J 7 C 2' n l No full chain C G C n meets J 7 
four times, or else J 7 contains which has T>2 as a subposet. If no chain meets J 7 three 
times, we immediately get ^(J 7 ) < h^J 7 ) < 2. Else, consider any three elements of F 
X C Y C Z, and let Y' be any set not equal to Y such that X G Y' G Z. Let o be 
a permutation of [n] that fixes X and Z and sends Y to K'. Then a sends full chains 
C G C n through X, Y, Z to full chains C G C through X,Y', Z. These chains C meet 
J 7 only twice, as J 7 is diamond-free. These chains C are distinct. We find then that 
h(F) = E(|J"nC|) < 2.5. 
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Unfortunately, the behavior of h^F) does not match that of \J-\/ asymptotically- 

there can be a gap. We shall see examples of this for diamond-free families. Nonetheless, 
in many cases we can obtain vr(P) from h(T). Besides that, it is interesting in its own 
right to maximize ^(J 7 ) for P-free families J 7 , though obtaining a good bound on /^(J 7 ) 
can be difficult. We have discovered that a "partition method" can be fruitful. 

Specifically, we partition the set C of full chains into blocks C(i) and then, for each i 
separately, we bound the average size |J-Tl C\ over full chains C G C{i). The principle is 
that the average size | T n C\ over all full chains C is at most the maximum over i of the 
average over block C(i). An analogy to baseball is helpful for some readers: A hitter's 
average over a whole season is never more than his maximum monthly average over the 
months in the season. 

We illustrate the partition method by sketching a derivation of La(n, B). Let J 7 be a 
butterfly-free family of subsets of [n], n > 3. One can check that if J 7 contains or [n], 
then (J 7 ! < S(n, 2) (although, one may have ^(J 7 ) > 2). Else, suppose 0, [n] J 7 . We 
show ^(J 7 ) < 2. Define the collection M. of subsets M G J 7 of [n] for which there exists 
a chain C G C passing through A, M, B G T with A C M C B. Notice that since J 7 
contains no butterfly B, it contains no P4, and so the collection M. is an antichain. Now 
partition the set of full chains C as follows: For M G M, Cm consists of all full chains 
meeting M, while C® contains all full chains that do not meet Ai. 

By definition of M., no chain in C® meets T three times, and so avecec \F H C\ < 2. 
For M G M, similar to the argument above for IVfree families, for any chain C G Cm 
meeting J 7 three times, it must meet J 7 in A,M,B, and there is a corresponding chain 
in Cm meeting T only at M and avoiding A, B, so that avec g c M l^ 7 H C\ < 2. Hence, 
we have partitioned C into blocks such that J 7 meets chains in each block at most twice, 
on average, and hence at most twice, on average, over all of C. Thus, ^(J 7 ) < 2, and it 
follows that La(n, B) = S(n, 2), since B(n,2) is butterfly- free. 

Regarding the extremal butterfly- free families as far as achieving La(n, B), Lemma [3721 
above applies. In fact, it is known that T must be B{n, 2) for n > 5, though it is not true 
for n = 4: Consider T consisting of {{1}, {2}, {1, 3, 4}, {2, 3, 4}} and all six 2-subsets. 

However, in some cases we can show that La(n, P) is attained only by T = B(n, k): 

Lemma 3.3 Suppose that for poset P, e(P) = m, an integer. Suppose that for all n, all 
P-free families T C 2^ satisfy ^(J 7 ) < m. Then for all n, La(n, P) = m), and if J 7 
is an extremal family, then J 7 = B(n,m). 

Proof: Let J 7 be a P-free family with size La(n, P). According to Lemma |3~2| La(n, P) = 
S(n, m), since ^(J 7 ) < m. Further, J 7 = B(n,m) when n + m is odd. Hence, suppose 
n + m is even, so that J 7 = B(n,m — 1) together with any (( n _^w 2 ) subsets of sizes 
{n — m)/2 or {n + m)/2. Suppose for contradiction that J 7 contains subsets of both 
sizes (n — m)/2 and (n + m)/2. By the natural generalization of Sperner's proof of 
Sperner's Theorem (or by using the normalized matching property on the ranks (/ n J^/ 2 ) 

and (( n ^j)/ 2 )) 0' we can fi 11 ^ subsets A, B G J 7 with A C B, \A\ — (n — m)/2 and 
\B\ = (n + m)/2. Then the interval [A, B] C J 7 . This interval is a Boolean lattice, B m . 
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However, h m (B m ) — m + 1 > m, so that by hypothesis, B m must contain subposet P, 
which contradicts our assumption that J 7 is P-free. Hence, T only contains one of the 
two sizes (n — m)/2 and (n + m)/2. □ 

We saw that the conclusion of the lemma above fails for the butterfly P = B (but only 
for small n). The reason we could not apply this lemma to P = B is that the hypothesis 
fails for n = 2: The full Boolean lattice B2, which has one more element than £>(2, 2), is 
butterfly-free. 



4 Proofs of Results Eft E31 ELS 



We now illustrate our partition method to bring the bound for TVfree families below 2.5. 
Our best bound is derived in the next section. 

Proof of Proposition 12.31 : 

Let J 7 be a V 2 -iree family of subsets of [n] with maximum Lubell function value ^(J 7 ), 
and let d n denote this value. We claim that d n is nonincreasing for n > 2. By easy direct 
case study we get that d 2 = 2.5 and d 3 = rf 4 = 7/3 ~ 2.33. 

For n > 5, if both and [n] are in J 7 , then we have only one more subset in J 7 , and 
^(J 7 ) < 2 + - < 2.2. We will later give examples of families with Lubell function > 2.2, 
so T cannot satisfy this condition. Then we may assume by symmetry that [n] ^ J 7 . 
We partition the set C of full chains into the blocks C n ^, where the chains C G C n ^ pass 
through set [n] \ {i}. A random full chain in C is equally likely to belong to each C nji , and 
h(T) is simply the average over i of the values E(| J 7 D C|) taken over C G C n> i, viewed as 
the Lubell function for the subsets of [n] \ {i}. That is, ^(J 7 ) is the average of n terms, 
each of which is at most d n -\. Hence, d n = hi^J 7 ) < d n -\- 

Returning to the calculations, for d 5 we note that since ^(J 7 ) is a sum of terms, each 
1 or 1/5 or 1/10, d§ is a multiple of 1/10, and hence at most 2.3 since it is at most 7/3. 
Then dj, which is similarly a multiple of 1/105, must be less than 2.3, and hence at most 
241/105 < 2.2953, and so by Lemma E21 for n > 7, \F\ < 2.2953 ("), which implies the 
theorem. □ 

Next is the result for D^-free families for general k. 

Proof of Theorem 12.51 

Let n, k > 2, and define m := |~log 2 (A: + 2)] . 

For the lower bounds, consider T = B(n,m). We have \B — A\ < m — 1 for any two 
subsets A C B in J 7 . There are at most 2 m_1 — 2 subsets S satisfying A C S C B. Hence 
T is Dfc-free. So m < e(T>k) < vr(Pfc). 

Now we derive the upper bounds. Let J 7 be a largest £Vfree family in B n . We take 
what we call the min-max partition of the set C of full chains in B n according to J 7 : For 
subsets A C B C [n] with A,B G J 7 , the block Ca,b consists of the full chains C such that 
the smallest and the largest subsets in J 7 n C are A and B, respectively. We denote by 
Cq the block of full chains that do not meet J 7 at all. For C G C®, we have | J 7 fl C\ =0. 
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We now bound the expected size of |J-TlC| for a random chain C in Ca.b- If \B — A\ < 
m — 1, then this is at most m immediately. 

For the remainder, assume \B — A\ > m. We use the Lubell function Lemma [3.11 to 
calculate E(| J 7 D C|) by adding the contributions of each subset S G J 7 fl [A, B], which is 
1/ (mZ^j) • Since T is Pfc-free and contains both A and B, there are at most k — 1 subsets 
S G J-f] [A, B] besides A and B. Then E(|J-"nC|) is maximized if we take the k — 1 terms 
with largest contribution, z.e., with minimum (jfl^J), which means the sets S closest to 
the ends A or B, so with IS* — A\ equal to 1 or \B — A\ — 1, then 2 or \B — A\ — 2, and so 
on. The contribution from each full level we include is then one. 

For the case (1), where k — 1 < 2 m — 2 — (^mj) , we see that for \B — A\ = m, the k — 1 

terms are at most enough to account for all subsets S G [A, B] with \S — A\ not equal to 
|_m/2j, that is, we get Lubell function at most m (when we include the terms for A and 
B). For \B — A\ > m, since the levels working up from A or down from B are larger, the 
k — 1 terms are no longer sufficient to cover as many full levels, and the Lubell function 
is strictly less than m. Since every block in our partition has expected value at most m, 
we conclude that h(T) < m. Lemma 13.21 gives us \J- \ < S(n, m). Furthermore, we also 
have e{T>k) > m. Hence m = e{T>k) = vr(Dfc) < ^(J 7 ) < m. By Lemma [3.31 the extremal 
family J 7 must in fact be B(n,m). 

For the case (2), where k — 1 > 2 m — 2 — (^mj), we see that for \B — A\ = m, the 

largest sum of k — 1 terms leads to Lubell function at most m + 1 — (2 m — k — 1)/ (^m,) . 

As in case (1), if \B — A\ > m, then since the levels working up from the bottom or down 
from the top in [A, B] are larger, the Lubell function is strictly less than this bound. The 
bound holds for every block Ca,b of the min-max partition. Therefore, if n(T>k) exists, 
ir(V k ) <h{T) <m + l-{2 m -k-l)/{ L ™). □ 

Now we use the min-max partition of the set of full chains to prove the Harp Theorem. 

Proof of Theorem 12.61 

We argue that h^J 7 ) < l\ — 1 for any TL(li, . . . , Zfc)-free J 7 using induction on k. The 
case k = 1 concerns a family J 7 that contains no chain of height 1%, for which we get 
immediately that ^(J 7 ) = EdJ 7 fl C\) < maxc {J 7 D C\ < h — 1 (which implies Erdos's 
Theorem ED). 

Let k > 2, and assume the bound on h for harps with k — 1 paths. Let T be an "H-free 
family of subsets of [n], where H = . . . and consider a block Ca,b in the min-max 
partition of the set of full chains C induced by J 7 . Let t be the largest height of any chain 
in 7 n [A, B). If t < h we get that for full chains C in this block, E(| 7 n C\) < h - 1. 

Otherwise, t > l x . Consider a largest chain Z in J 7 fl (A, JB), say Si C • • • C S t _ 2 ? 
where A C Si and S t _ 2 C B. Let J 7 ' be J 7 fl [A, £>] with the sets in Z removed. 

Then Z and J 7 ' are disjoint and E(J 7 flC) for random full chains C in this block is the 
sum of E(Z n C) and E(J 7 ' n C). For the Z term, by Lemma Owe get $~\ < 
(t — 2)/|S — j4| < 1. For the other term, we observe that J 7 ' is "H(/2, • • • , Zfc)-free in the 
Boolean lattice of subsets of [A, B}. By induction on k, hi^J 7 ') < ^2 — 1 < ^1 — 2. So C meets 
J 7 ' on average at most l\ — 2 times. Combining terms, we find that C meets J 7 at most 
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lx — 1 times on average for C in this block, and hence for all random full chains C G C. We 
have that h(F) <l\ — l. By Lemma I3T21 we get that La(n, %(7i, . . . ,4)) < S(n, l\ — 1). 

The family B(n, l\ — 1) achieves the upper bound just given, since it does not contain 
an /i-chain, and is thus . . . , Zfc)-free. We see that 

e(n(h,...,l k ))=ir(U(!i,...,l k )) = h-l- 



Moreover, by Lemma |3.3[ the only harp-free family of maximum size is B(n, l\ — 1). □ 



5 Proof of Do Theorem 2.4 



We investigate the structure of T> 2 -free families with maximum Lubell function, and use 
this information to improve our earlier bound. Before proving Theorem I2.4[ we continue 
from the proof of Proposition 12.31 in the last section, assuming all notation and facts from 
that. 

We adopt the notation that for any families T\, . . . , T m of sets, T\ V • • • V T m denotes 
the family {Fi U • • • U F m | ViFj G J-j}. Given disjoint sets S,T we define the following 
three constructions: 

Construction d(S,T): 7 = {0} U (f) U ((f) V Q)) U Q. 

Construction C 2 (S,T): JF = {0} U (f) U Q U ((f) V ®) U ((f) V g)). 

Construction C 3 (5,T): J 7 = (W) U (f) U © U ((f) V Q) U ((f) V ©). 

We will typically partition [n] into subsets 5, T in using these constructions, and we write 
Ci(s,n — s) for Cj([s], [n] \ [s]), for integers s, < s < n. The families above are V 2 - 
free and each h(Ci(s } n — s)) = 2 + ^(n-i) • -^ or n — 2, the maximum value over s is 

2 + rw ffi n /) 2J > 2 - 25 > achieved by s = \n/2] or [n/2j. 

In our approach the key to proving Theorem 12.41 is to focus on V2-free families J 7 that 
contain 0. Let S n be the maximum value of h(F) for all such families. Definitions give 
that 2 + ^fazffl < $n ^ <^n- Even though we do not obtain the values of d n , we can 
obtain 5 n for n up to 12. This technical information (including the extremal families for 
S n ) makes up the following lemma, which is the hard part in proving the Theorem. 

Lemma 5.1 The sequence {5 n } satisfies the following properties. 

(1) It is nonincreasing for n > 4. 

(2) For 4 < n < 12, if JF contains and h n {F) > 2^-, then up to relabeling elements 
of[n},TisC 1 {s,n-s)fors^[^\ or [f] ; orC 2 {[%\, \*}). Hence, 5 n = 2 + ™^ . 

Proof of Lemma I5.lt 

To show (1), let J 7 be a V 2 -iree family of subsets of [n] such that ^ n (J r ) = S n . For 
n > 5, if both and [n] are in J 7 , then we have only one more subset in J 7 , and /i n (J r ) < 
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2 + - < 2.2. Thus [n] ^ F. Then similar to the proof of Proposition I2.3[ we partition the 
set C of full chains into the blocks C n> j, where the chains C G C n> i pass through set [n] \ {i}. 
Again, h(F) is the average over i of the values E(| F fl C\) taken over C G C n j, viewed as 
the Lubell function for the subsets of [n] \ {i}. That is, h n (F) is the average of n terms, 
each of which is at most <5 n _i. In other words, for 1 < i < n, let F% — {F G F \ i ^ F}. 
Then each F{ is a X? 2 -free family in 2^^^. We have 



iV-X>-i< 



h n (F) = ave ( ave IJ-TiCI = — > h n AFi). 
i<i<n yceCn,, 1 .. / . 

Hence, £ n = h n (F) < 5 n _i- 

We claim the following two facts which are needed in showing (2). 

Claim 1: The inequality 2 + ^i"/)^ ~ (Jj < 2 u holds for a11 n ^ 12 - 
This can be verified by a simple computation. 

Claim 2: For 4 < n < 12, suppose F C Ci(s,n - s) with h n (F) > 2^. Then F = 
Ci(s,n - s) with s = |"f~| or |_f_|- Similarly, if J 7 C C 2 (s,n - s) with /^(J 7 ) > 2^-, then 
J = C 2 (LfJ, [f 1 )(The same as C 2 ( [f ] , Lf J ) by relabeling the elements). 

One can calculate that if \s — (n — s)\ > 1, then h n (Ci(s,n — s)) < 2^-. Furthermore, if 

[n/2] [n/2\ l 9 _3_ 

n(n-l) (f) ^ Z ll 

if J 7 C C 2 (LfJ, [fl), then ^(J-) < 2 + ^ffl - t^t < 2^. So Claim 2 holds. 



F C n - s) with s = [f 1 or [fj , then h n (F) < 2 + - -X- < 2£. Similarly, 



. 2 J> I 2 '«nv / n(n-l) (™j "11' 

We show (2) by induction on n. When n = 4, it can be directly verified by enumeration. 
There are 17 classes(up to relabeling of elements of [n]) of D 2 -free families containing 0. 
The classes Ci(2,2) and C 2 (2,2) satisfy h(F) = 2| while the rest of them have h(F) at 
most 2 ~ which is less than 2^-. 

Assume n > 5 and the statements are true for n — 1. Now we consider a D 2 -free family 
F C 2^ satisfying h n (F) > 2— and G J 7 . Again, the full set [n] is not in J 7 . Otherwise, 
J 7 contains at most one more subset other than and [n], and h n (F) < 2 + - < 2—. Since 
h n {F) > 2—, there exists i so that h n -i(Fi) > 2^-. We may assume h n -i(F n ) > 2^-. By 
inductive hypothesis, J"„ is dfl/^J, ^1), Ci(^l, L^J), or C 2 (L^J, [^D- We 
consider two cases. 

Case 1: F n = Ci(£,T) where |5| = L^J or [2=1]. 

It remains to decide the subsets in F\F n . Here are two subcases depending on whether 
{n} is in F. 

Subcase la: {n} G F. 

Since F is D 2 -free, it contains no subsets of forms {si, n}, {si, s 2 , n, . . .}, {s\, ti, n, . . .}, 
and {ti, t 2 , n, . . .} for Sj, G S 1 and tj G T. Thus, J 7 C Ci(5 U {n}, T). Since h n (F) > 2—, 
we conclude that (S 1 ) + 1 must be |_f J or [|]. Thus, by relabeling elements of [n] we have 

that dan, Lf j). 

Subcase lb: {n} G^ J 7 . 
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Let S' = {s G S | {s,n} G J 7 } and V = {f G T \ {t',n} G J 7 }. Since J= is 
V 2 -free, J 7 cannot have subsets of forms {s±, s 2 , n, . . .}, {si,ti,n, . . .}, {ti,f, n, . . .}, and 
{ti,h, t3, ■ ■ •} for Si G S, ti G T, and i' G T". Equivalently, 

T C U ((?) V {{n}}) U ((V) V {{n}}) U (( T f) V {{-}})• 



Then 



4 ' - 1 » o J o © 

n U 

(IT-T'I\ l T ,, 

Here /(|T'|) = v A 7 + -U4 is a quadratic function of \T'\ defined on the integer points 
UJ UJ 

of the interval [0, |T|]. Its maximum is reached at one of the two ends, namely \T'\ = 
or |T'| = |T|. 

Claim 3: If h{F) > 2^, then \T'\ = \T\. 

For n = 5, we have \S\ = \T\ = 2. If \T'\ < \T\, then /(|T'|) < max{/(0), /(|T| -1)} = 

^. Thus, ^(J 7 ) < |s||r|+ U |) < 2^, which contradicts our assumption, 

and so \T'\ = \T\. 

For n = 6, we have either |S| = 2 and \T\ = 3, or else \S\ = 3, |T| = 2. If |T'| < 
|T|, then /(|T'|) < max{/(0),/(|T| - 1)} = £ for (|5|,|T|) = (2,3), and /(|T'|) < 
max{/(0), /(|T| - 1)} = ^ for (\S\, \T\) = (3, 2). By direct computation, both cases give 

h(J-) < 1 + 7f + ■ + < 2^-, which is again a contradiction. 

" UJ 

For 7 < n < 12, both /(0) and /(|T| - 1) are at most 7^-7^. Thus, 

UJ UJ 

- h m<1 , 1^1 , igim + (■?) + m + m 1 
M } - - (2) "G) 

< fi B (C' 1 (|S|,|r| + 1))-^ 



[n/2] Ln/2J J_ 3 
" + n(n-l) g) < ir 

This contradiction again proves |T'| = |T|, and completes the proof of Claim 3. 

Hence, ( T f ) V {{n}} is a null family. Namely, J 7 C Ci(S,TU {n}). By the condition 
h n (J-) > 2^-, we conclude by relabeling elements of [n] that J 7 = Ci(|_§J, [f]), which is 
one of the listed possibilities in (2). 

Case 2: T n = C 2 {S,T) where \S\ = [^\ and |T| = [2=1]. 
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We determine what are the possible subsets in J 7 \ T n . Consider the two subcases 
depending on whether {n} G J 7 . 
Subcase 2a: {n} G T . 

Since T is V 2 -fTee, T cannot contain subsets of forms {si, s 2 ,n, . . .}, {ti, t 2 , n, . . .}, and 
{u, v, w,n, . . .} for si, s 2 G S, ti,t 2 G T and u,v,w G [n]. Let 5" = {s G 5 1 {s, n} G J 7 } 
and T' = {* G T \ {t, n} G J 7 }. Then 

^ G T n U {{n}} U ((f) V {{n}}) U ((f) V {{n}}) U (( 5 f) V (T) V {W}). 

We have 

\sun (\s\-\s'\)(\T\-\r\ 

/-rt \ 



G) (a) 
= h n (C 2 (\S\ + l,\T\))-b, 

where we see that b : = -I + + (^™^d^] is a bilinear function of 

(I S' |, |T'|) defined on [0, j^l] x [0, |T|]. To find the extremal values of b it suffices to check 
the four corner points (0,0), (|S],0), (0, |T|), (\S\, \T\). We find the minimum value of b, 

at(|S|,|T|),is^p>^. Hence, 



'9' 



ftnl ^ j - + n(n-l) g) 11' 

which contradicts our assumption, so this subcase is impossible. 
Subcase 2b: {n} £ T . 

Similar to subcase 2a, let S' = {s G S \ {s,n} G J 7 } and T' = {t G T | {t, n} G J 7 }. 
The family J 7 cannot have subsets of forms {s, s', n, . . .}, {V, t', n, . . .}, {£, n, . . .}, and 
{u, v, w,n, . . .} for s G 5, t G T, s' G S", £' G T" and u,v,w G [n]. Then 

^ G ^ U ((f) V {{n}}) U ((f) V {{n}}) U ((((f) V (f)) \ ((f) V (f ))) V {{n}}) U 

>{W})u((T)v{W})- 



S\5' 
2 
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We have 



hjr] < , j + ('?) + (?) + (T)isi + pf)m \ + i5i + in 



+ 



G) (3) ; 
|gim-|giiri + ('Y') + ('T') 

fe n ((7 2 (|S'| + l,|r|))--^-, where 

\3J 






= £i|T'| +£ 2 |S\S / |, with 

= 2(n - 2) _ |r| - 1 _ 5(\S\S'\) = ^1_ \S\-\S'\-\ _ \T\ 

1 3 2 6 2 3 2 6 

If (\S'\,\T\) = (\S\,0) or (0,|T|), then JF c C 2 (SU{n},T) or ^ C C 2 (S,TU {n}). 
Since h n (J-) > 2^-, we have, by relabeling elements of [n], J 7 = C^dSJj [fl)j which is 
another of the alternatives listed in (2). 

Claim 4: When |T'|) ^ 0) or (0, |T|), then # > 1. 

Recall that < \S'\ < \S\ = [^\ and < \T'\ < \T\ = p=±]. 
Suppose n = 2k, so we have \S\ — k — 1 and \T\ — k, k > 2. Rewrite 

£l = (3|T \ T'| + 5|S"|)/6 and e 2 = (|T \ T'\ + 3|S"| + 2)/6. 

Note that \S'\ and |T \ T'\ are not both zero, nor are \T'\ and |5 \ S'\ both zero. One can 
see that e\ and e 2 are each at least 1/2, and so g > 1 unless \T'\ = 1 and \S \ S'\ = 0, or 
\T'\ = and \S \ S'\ = 1. But either pair of conditions increases the e's and still leads to 
<?>!■ 

Else, suppose n = 2k + 1, |,S| = |T| = k and k > 2. This time rewrite 

£l = (3|T \ T'\ + 5|S"| - l)/6 and e 2 = (|T \ T'\ + 3|5"| + l)/6. 

Again it is simple to check that g > 1, and Claim 4 holds. 

From Claim 4, we have if (\S'\, |T'|) 7^ ( | aS' | , ) or (0, |T|), then once again we get the 
contradiction 

- . , \n/2]\n/2\ 1 3 

K{F) < 2 + 1 ' IL '- J - < 2—. 

n(n - 1) (3) 11 

This completes the Case 2 and the proof of the Lemma. □ 
Now we are ready to prove our improved bound. 

Proof of Theorem 12.41 Let J 7 be a D 2 -free family of subsets of [n]. Partition J 7 into 
J-fc and T' such that J-j. contains subsets of sizes in [k,n — k] where k — | — 2\/n Inn, 
and J 7 ' = J-~\ J-fc. We know that (J 7 ']/ (, « J < for large n (see [IT] . Lemma 1). 
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Now concentrate on the family of sets near the middle, T\.. We take what we call the 
min partition of the set C of full chains in B n \ Let be the block containing the full 
chains that do not meet at all. For each subset A £ let Ca be the block containing 
all full chains C having A as the minimal element in Tk H C. We see that the average 
number of times a chain in Ca meets J~k is obtained by considering only the subsets in 
that contain A and viewing them (after removing A from each) as a diamond-free family 
of subsets of [n] \ A containing 0. We deduce that avecec A ( I nC|) < For n large 

enough, we have n — \ A\ > | — 2\/n\nn > 12. Since the 5 n are nonincreasing, we have for 
large n that h(J-'k) < $12 — 2^-. It follows that for sufficiently large n, all U 2 -free families 
T in B n satisfy 

m \ZA±]Z1 <2 — — 
(l?j) (l?j) ~ 11 n2 ' 

Consequently, if it exists, the limit 7r(P 2 ) < 2^-. □ 

6 Further Research 

Beyond diamonds T>k, we continue to investigate why the limit 7r(P) exists for general 
posets P. The methods introduced in this paper have proven to be useful for determining 
7r(P) for several other small posets P, which we are collecting separately [9J HOj [TJ]. One 
example is the subposet of T> 2 consisting of four elements A,B,C,D with B < A 
and B < C < D. Forbidding J is more restrictive than forbidding T>2. We show that 
for n > 1, La(n, J7") = S(n, 2), and hence 7r(j7") = 2. All known values of vr(P) satisfy 
Conjecture 12.21 

In order to resolve the asymptotics for diamond-free posets, and show that tt(V 2 ) = 2 
as expected, it is not enough to work with the Lubell function due to families such as 
those in the constructions Ci described above. These examples show that the terms in 
the sequence 5 n , which was shown to be nonincreasing for n > 2, are at least 2.25 for all 
n > 2. We suspect that the limit lim^oo d n , which is known to exist, is 2.25. This would 
follow from the conjecture below. Here, the conjugate of J 7 is the family J 7 = {F | F £ J 7 } 
where F — [n] \ F is the complement of P. 

Conjecture 6.1 For every n > A, the value /i(P) of any T> 2 -free family T C 2^ satisfies 
- | — | 

h n {T) < 2 + w( - n 4 _ 1 ^ and equality holds if and only if, up to relabelling elements of [n], T 
orT is Ci(S,T) (i = 1,2,3) with \ \S\ - \T\\ < 1. 

Then how might we reduce our upper bound on tt{T>2) (if it exists) to below 2.25? 
The examples above are nowhere near as large as 2^",), yet have very small sets that 
make large contributions to the Lubell function. To build large diamond-free families, we 
should restrict our attention to families with no small nor large sets, say T C Bin, k) with 
k = n — f[n) < n. If we could show that \T\ is at most (2 + o n (l)) (, , for suitable f(n), 

we would have 7r(P 2 ) = 2 as we expect, since most of the 2 n subsets are concentrated 
near the middle rank. 
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Another indication of the challenge facing us for T> 2 is that we have constructed three 
TVfree families for n = 6 of size 36, which is one more than £(6,2). This is in contrast 
to the values k for which Theorem 12.51 determines La(n, T>k) completely, and its value is 
exactly £(n, m). Thus, the solution for V 2 , and probably also T>k for the unsettled values 
k, is likely going to be more complicated. 
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